Microsatellite‐based analysis reveals Aedes aegypti populations in the Kingdom of Saudi Arabia result from colonization by both the ancestral African and the global domestic forms

Abstract The Aedes aegypti (Linnaeus, 1762) mosquito is the main vector of dengue, chikungunya and Zika and is well established today all over the world. The species comprises two forms: the ancestral form found throughout Africa and a global domestic form that spread to the rest of the tropics and subtropics. In Saudi Arabia, A. aegypti has been known in the southwest since 1956, and previous genetic studies clustered A. aegypti from Saudi Arabia with the global domestic form. The purpose of this study was to assess the genetic structure of A. aegypti in Saudi Arabia and determine their geographic origin. Genetic data for 17 microsatellites were collected for A. aegypti ranging from the southwestern highlands of Saudi Arabia on the border of Yemen to the north‐west in Madinah region as well as from Thailand and Uganda populations (as representatives of the ancestral African and global domestic forms, respectively). The low but significant level of genetic structuring in Saudi Arabia was consistent with long‐distance dispersal capability possibly through road connectivity and human activities, that is, passive dispersal. There are two main genetic groupings in Saudi Arabia, one of which clusters with the Ugandan population and the other with the Thailand population with many Saudi Arabian individuals having mixed ancestry. The hypothesis of genetic admixture of the ancestral African and global domestic forms in Saudi Arabia was supported by approximate Bayesian computational analyses. The extent of admixture varied across Saudi Arabia. African ancestry was highest in the highland area of the Jazan region followed by the lowland Jazan and Sahil regions. Conversely, the western (Makkah, Jeddah and Madinah) and Najran populations corresponded to the global domesticated form. Given potential differences between the forms in transmission capability, ecology and behaviour, the findings here should be taken into account in vector control efforts in Saudi Arabia.


| INTRODUC TI ON
Aedes aegypti (Diptera: Culicidae) is the primary vector of pathogens responsible for yellow fever, dengue, chikungunya and Zika (WHO, 2022).In Saudi Arabia, dengue fever cases have been reported in the country's southwestern regions, that is, Jeddah, Makkah, Madinah, Jazan and Sahil (Al-Azraqi et al., 2013;Altassan et al., 2019;Fakeeh & Zaki, 2003).Two cases of chikungunya were reported: one qRT-PCR-confirmed autochthonous case in Jeddah in 2011 and another IgG-seropositive case in Jazan in 2021 (Hakami et al., 2021;Hussain et al., 2013).However, there have been no reported cases of Zika to date.In 2013, the incidence rate of dengue cases increased to 21.71 per 100,000 persons/year and dropped to 10.03 in 2021 (Alhaeli et al., 2016;Health, 2019Health, , 2021)).In the Jazan region of southwestern Saudia Arabia, the total number of dengue cases was 4985 between 2005 and 2021, but there have been no reported cases in the nearby highland area (Health, 2021).A good understanding of population genetic structure, ecology as well as the genetic make-up of local A. aegypti is essential for understanding the temporal epidemiology and risk of arboviral diseases.
The southwestern islands of the Indian Ocean host the oldest A. aegypti populations and are the likely source of A. aegypti that colonized Africa and subsequently spread to the rest of the world (Soghigian et al., 2020).Aedes aegypti has been in Africa for ~85,000 years, where it accumulated genetic diversity (Soghigian et al., 2020).However, the spread of A. aegypti to the rest of the tropics is recent and characterized by a genetic bottleneck resulting in low genetic diversity (Powell & Tabachnick, 2013).The slave trade from Africa to the New World between 1500 and 1650 and commercial shipping between countries are responsible for the spread of A. aegypti from Africa (Powell et al., 2018).In the Arabian Peninsula, A. aegypti has been present since at least 1956 in Jeddah, Makkah and Jazan, Saudi Arabia (Mattingly & Knight, 1956).Today, A. aegypti is found in the south and southwest regions of Saudi Arabia, along the Red Sea (Alikhan et al., 2014), and recently was recorded in areas of the north and central regions of the country (Al-Rashidi, pers. comm.).The origin of the Saudi Arabian population is unknown.
There are widely considered to be two forms of A. aegypti: a dark forest-dwelling form A. aegypti formosus (Aaf) from sub-Saharan Africa: and the paler coloured domestic form A. aegypti aegypti (Aaa) that dispersed from West Africa using global trade routes, to the rest of the tropics and subtropics (Powell et al., 2018;Powell & Tabachnick, 2013;Rose et al., 2023).Aedes aegypti formosus is generally considered not to occur outside of Africa.In this respect, it is interesting to note that two forms of A. aegypti, a dark form and a pale form, have been reported in the Arabian Peninsula (Mattingly & Knight, 1956).We, therefore, hypothesize that A. aegypti in Saudi Arabia arises from two sources, the diaspora of the global domestic form and the direct migration of the A. aegypti formosus form from Africa.Direct migration from Africa seems highly plausible given that the southwest region of Saudi Arabia up to and including Makkah and Jeddah is considered part of the Afrotropical zoogeographic region (Holt et al., 2013;Lane & Crosskey, 2012).Further, there has been extensive historical trade over the last two to three thousand years using sea routes connecting the Arabian Peninsula and Africa, that primarily used Aden, a Yemeni port (Martin & Vigne, 1997).This provides a viable transportation route for A. aegypti formosus from Africa to Saudi Arabia.The presence of A. aegypti formosus in Saudi Arabia has been suggested previously (El-Badry & Al Ali, 2010), and the argument put forward that divergent mtDNA clades in Saudi Arabia support this (Khater et al., 2021).However, the presence of both these clades throughout Africa as well as other tropical countries (Bennett et al., 2016) means such conclusions cannot be drawn from mtDNA.
Unplanned urbanization is associated with the spread of A. aegypti and the diseases it transmits (Kolimenakis et al., 2021).In addition, Ugandan population and the other with the Thailand population with many Saudi Arabian individuals having mixed ancestry.The hypothesis of genetic admixture of the ancestral African and global domestic forms in Saudi Arabia was supported by approximate Bayesian computational analyses.The extent of admixture varied across Saudi Arabia.African ancestry was highest in the highland area of the Jazan region followed by the lowland Jazan and Sahil regions.Conversely, the western (Makkah, Jeddah and Madinah) and Najran populations corresponded to the global domesticated form.Given potential differences between the forms in transmission capability, ecology and behaviour, the findings here should be taken into account in vector control efforts in Saudi Arabia.

K E Y W O R D S
Aedes aegypti, Aedes formosus, Arabian Peninsula, genetic diversity, microsatellites, mosquitoes, population genetics, Saudi Arabia the widespread use of air-conditioning systems, the water containers of which provide common breeding sites for A. aegypti, results in a high abundance of adult A. aegypti inside houses (Ali et al., 2021;Khater et al., 2021).A. aegypti is abundant year-round with density reported to peak in April in Madinah (El-Badry & Al Ali, 2010), January to March in Makkah (Aziz et al., 2012), December and January in Jeddah (Mahyoub, 2015), and November and December in Jazan (Alahmed et al., 2010).
Landscape features such as highways, rivers and primary roads play an important role in the dispersal of A. aegypti (Regilme et al., 2021).Passive dispersal through human transportation (likely in the form of immature stages or eggs) has been reported in A. aegypti from Argentina (Maffey et al., 2020(Maffey et al., , 2022)), Sri Lanka (Fernando et al., 2020) and Southeast Asia (Hlaing et al., 2010;Huber et al., 2002).Due to the short flight distance (average lifetime dispersal <200 m), active dispersal in A. aegypti is unlikely to shape genetic structure at the countrywide level (Moore & Brown, 2022).
Passive dispersal was found to increase in urban areas (Maffey et al., 2022) and was weak in mountains or isolated geographical locations (Fernando et al., 2020).This potential for long-range dispersal constitutes a major threat, as undesired traits such as insecticide resistance or a pathogen could be introduced to new locations.
Microsatellite markers have been widely used to characterize genetic population structure in A. aegypti worldwide (Gloria-Soria et al., 2016).In Saudi Arabia, only two studies from Madinah and Jeddah have addressed the population dynamics of A. aegypti using the mitochondrial cytochrome-c-oxidase subunit-1 (CO1) and dehydrogenase subunit-4 (ND4) (Ali et al., 2016;Khater et al., 2021) gene regions, but no finer scale genetic studies (i.e., using microsatellites or SNPs) have been conducted yet.Similarly, a study of the gene flow dynamics in southern and southwestern Saudi Arabia (i.e., Jazan and Sahil) is still absent (Mashlawi et al., 2022).Understanding population connectivity in this species is critical for determining the efficacy of innovative mosquito control strategies, that is, genetically modified mosquitoes and Wolbachia releases.Besides, understanding the genetic composition of A. aegypti populations in Saudi Arabia is also important since the two forms may differ in disease risk (Dickson et al., 2014;Weetman et al., 2018).Within this context, in this study, we aim to assess the genetic structure, gene flow regime and number of A. aegypti genetic clusters present in Saudi Arabia and test the hypothesis that A. aegypti in Saudi Arabia has been formed by admixture between ancestral populations from Africa and the global domestic form.

| Study sites
Saudi Arabia, which lies in southwestern Asia, is the largest country in the Arabian Peninsula.Geographically, Saudi Arabia is divided into three distinct zones: the rain-fed highlands of the western and southwestern regions (Sarawat Mountains), the arid and extra-arid lands of the interior (Najd), and the coastal plain along the Red Sea in the west of Saudi Arabia (known as the Tihamah) that includes the east of the Hejaz and the Asir mountain range (Figure 1).
Most collections for this study came from the Tihamah and Hejaz regions.Six different regions, namely, Jeddah, Makkah, Madinah, F I G U R E 1 Map of Saudi Arabia showing the sampling locations of Aedes aegypti, that is, Jazan, Sahil, Makkah, Jeddah, Madinah and Najran with main road connectivity (left) and the STRUCTURE bars of each population based on K = 2 in STRUCTURE software (Pritchard et al., 2000).Colours within each bar represent each genetic cluster, and the percentage of the colour indicates the percentage of ancestry of each cluster for a particular individual in this study (right).The website ArcGIS (https:// www.arcgis.com/ index.html) was used to generate the map.
Jazan, Sahil and Najran, were included in this study.More information about each region is referred to in Mashlawi et al. (2022).We refer to Makkah, Jeddah and Madinah as 'western' in this study.The Sahil region (stretching ~400 km along the Red Sea coast, linking Jazan with Makkah and Jeddah) was used to evaluate spatial connectivity, as it represents a major concern in terms of passive dispersal due to the high level of use of the international road from Yemen to Makkah through Jazan and Sahil.The Najran region (17°32′ N 44°13′ E) also shares a border with Yemen to the south and is known for an ancient Christian settlement around the 7th century (Frankfurter, 1998), which means there has been a long-term movement of people into this region of Saudi Arabia.To investigate the evolutionary history and test the hypothesis of genetic admixture within Saudi Arabia of the ancestral African and pantropical global forms, population samples from Thailand and Uganda were also included to represent these potential source populations (Table 1).Use of a single population to represent each form is reasonable given that there is far greater genetic differentiation between the forms than there is between populations within the forms (Elnour et al., 2022;Gloria-Soria et al., 2016).

| Mosquito collections
Samples of A. aegypti eggs, larvae and pupae were collected from six regions (four administrative areas) in Saudi Arabia between 2019 and 2022 during the wet season (November to February).The samples were collected from different sites including air-conditioning water containers and disposable plastic containers (Figure 2).The Thailand samples were collected in 2017/18, and the samples from Uganda were collected in 2020.
The samples were collected from 11 locations in Saudi Arabia which were a mix of urban, semi-rural and rural areas (villages) (Table 1).The maximum distance between sampling locations within a location ranged from 22 to 58 km.Mosquitoes were sampled using a larval dipper from a total of 103 collection sites (detailed in Table 1 and Figure 1).All immature stages were maintained at the Centre for Disease Control and Prevention, Ministry of Health, Sabya, at a temperature of 28 ± 2°C and relative humidity of 75 ± 10%, as previously described (Mashlawi et al., 2020).Collected samples were identified morphologically as A. aegypti using a mosquito taxonomic key (Rueda, 2004).During this process, the presence of two forms of A. aegypti (paler and dark) in Saudi Arabia was noted as previously reported by Mattingly and Knight (1956) (Additional file 1: Figure S1).
The dark form was only found in the Jazan highland region.The samples were preserved in tubes with silica gel and transferred to the University of Manchester, UK, for molecular work.

| DNA extraction and microsatellite genotyping of A. aegypti populations
Genomic DNA was extracted using the DNeasy Blood and Tissue Kit (QIAGEN Sciences, Germantown, MD, USA).As full siblings are expected in larval containers (Schmidt et al., 2018), which can bias analyses of population structure (Goldberg & Waits, 2010), we genotyped a maximum of 2-3 individuals from small containers.In Madinah and Najran, there were fewer, but larger, containers so up to 5 individuals were genotyped from each container to obtain adequate population sample sizes.Relatedness among individuals from each location was tested using the maximum-likelihood method in ML-RELATE (Kalinowski et al., 2006).
A total of 389 individual mosquitoes were genotyped for 17 microsatellite markers according to established protocols (Brown et al., 2011;Slotman et al., 2007) (Additional file 1: Table S1).Although previous studies used 10-12 loci, increasing the number of loci (and particularly having more variable loci) is likely to increase the power of population genetic inferences compared to increasing the number of individuals (Landguth et al., 2012).The 17 loci were amplified using three multiplex PCR assays developed in this study (Set 1: B2, AG1, AC4, AG2, CT2 and A9; Set 2: AC5, B3, A1, AG5, AC2 and AC1; and Set 3: AC7, AG3, AG4, AG7 and AT1).Each reaction consisted of GoTaq G2 Colorless Master Mix 12.5 μL and 2.5 μL of 10X primer mix (2 μM each), 1 μL of DNA template (diluted 1:2) and 9 μL of sterile water to a total volume of 25 μL.PCR cycling conditions were as follows: an initial denaturation step at 95°C for 5 min, followed by 32 cycles of 95°C for 30 s, 60°C for 90 s and 72°C for 30 s with a 30 min final extension step at 60°C.For a few samples that did not show a complete profile, a repeat single-plex reaction was performed.The genotyping was performed at Eurofins Genomics, Germany, on an ABI 3130xl (Eurofins Genomics).All PCR products were diluted with RNase-free water at 1:300 dilution, and 1 μL of the diluted PCR product was mixed with 10 μL Hi-Di formamide and 0.15 μL of LIZ-500 internal size standard for loading onto the machine.The results were scored using GeneMapper 5.0 software (Applied Biosystems).

| Genetic analysis and population structure
Micro-Checker v2.2.3 (Van Oosterhout et al., 2004) was used to estimate the prevalence of microsatellite null alleles.The linkage disequilibrium (LD) tests were estimated among all possible pairs of the 17 loci in each population for a total of 13 populations using GENEPOP v4.2 (Rousset, 2008) (dememorization: 10,000; batches: 100; and iterations per batch: 10,000).In Excel calculator v1.2 (Gaetano, 2013), LD significance levels for multiple testing were corrected using the Holm-Bonferroni sequential correction to adjust the p-value to minimize type I error.
Two separate analyses for STRUCTURE were carried out: the first comprised 11 populations from only Saudi Arabia, and the second contained 13 populations from all of Saudi Arabia, Uganda and Thailand (Table 1).The genetic clusters and potential ancestry were identified using the Bayesian assignment algorithm implemented in STRUCTURE software v.2.3 (Pritchard et al., 2000).Twenty independent iterations were run with assumed populations ranging from K = 1 to K = 11 in the Saudi Arabia analysis and K = 1 to K = 13 when Uganda and Thailand were added.The calculation model for both the 11 and 13 population analyses was set as admixture ancestry with 100,000 burn-in steps with 1,000,000 MCMC replicates.Following this, the optimum K value was estimated using Evanno's delta K model (Evanno et al., 2005) in STRUCTURE Harvester (https:// taylo r0.biolo gy.ucla.edu/ struct_ harve st/ ).For better visualization of clustering plots, clumpak (http:// clump ak.tau.ac.il/ index.html) was used with the LargeKGreedy algorithm search method.
For complementary estimations of clustering, principal component analysis (PCA) and discriminant analysis of principal components (DAPC) were conducted in R using the Adegenet package (Jombart, 2008).DAPC grouping is obtained by maximizing the differences between the given populations.
To characterize gene flow (Nm, the number of migrants per generation) between groups, divMigrate-online was used to draw a network depicting estimated gene flow values among populations with the setting of α = 0.05; however, due to the small sample size in some populations, bootstraps were set to zero (https:// popgen.shiny apps.io/ divMi grate -online/ ) (Sundqvist et al., 2016).This analysis was conducted for Saudi populations alone and Saudi populations together with Uganda and Thailand.

| Admixture, history and demographic analyses
To test our hypothesis on colonization events and whether the genetic composition of A. aegypti in Saudi Arabia derives from introductions from both African and non-African populations, we used approximate Bayesian computation (ABC) methods in DIYABC 2.1.0(Cornuet et al., 2014).Population samples from Thailand and Uganda were used to represent global non-African and African groupings, respectively, in these tests of genetic admixture.The DIYABC program tested multiple scenarios and the best-supported scenario was chosen based on the highest posterior probability (p) (Cornuet et al., 2014).We tested three evolutionary scenarios.In

| Marker analysis and genetic diversity
The relatedness analysis (maximum-likelihood method in ML-RELATE) estimated the percentage of first-degree relatives (sib-sib or parent-offspring) to be very low in most population samples, from 0.7% in Jazan to 2.1% in Jeddah.The percentage of first-degree relatives was higher in Najran and Madinah (5.8% and 6.7%, respectively) (Additional file 1: Table S2).This is likely due to sampling more larvae per container but also possible accurately reflects the smaller sampling area of these populations.Since even these values of relatedness are low, they are expected to minimally affect estimates of genetic population structure and gene flow.
A total of 337 A. aegypti from 11 populations in Saudi Arabia and 52 A. aegypti from populations in Uganda and Thailand were characterized (Table 1).The results revealed a total of 226 alleles across the 17 genetic loci in the Saudi Arabia populations.The number of alleles was highest (174) in the Jazan highland region and lowest in Najran (80).The microsatellite marker AG2 showed the highest number of alleles (36), while the AG1 marker revealed the lowest (6) (Additional file 1: Table S3).An average of 13.29 alleles per locus was observed in the Saudi Arabia populations.All 17 genetic loci were polymorphic in all populations, except for AC2 and CT2 which were monomorphic in one population (SahilE, n = 5).
There was a low percentage of null alleles in the data (0.00 to 0.20) across all loci (Additional file 1: Table S4).Since the presence of null alleles at frequencies lower than 0.20 does not influence estimates of genetic differentiation (Chapuis & Estoup, 2007;Gloria-Soria, 2022;Wei et al., 2019), all loci were included in subsequent analysis.Only one pair of loci, AG1 and AG3, showed significant linkage disequilibrium across all the Saudi Arabian and the Thailand populations but not the Ugandan samples after Holm-Bonferroni sequential correction (Additional file 2).Brown et al. (2011) also found linkage disequilibrium for the two loci (AG1 and AG3) so removed one locus but this made no impact on the overall results (Brown et al., 2011).Both loci were retained by Rasheed et al., 2013 who detected no linkage disequilibrium and no physical linkage was reported between them when the markers were originally isolated (Slotman et al., 2007).We observed no substantial differences in our analyses when using both loci or when retaining only one, so we present analyses retaining both loci.
A summary of the genetic diversity over loci for each population is shown in

| Isolation by distance (IBD) within Saudi Arabia
Results of the Mantel test revealed no significant correlation between genetic and geographical distance (r 2 = 0.007, p = 0.53)

| Population structure and clustering analysis in Saudi Arabia
The analysis of molecular variance (AMOVA) results showed a very low genetic population structure with a percentage of variation among populations of only 4.87% (Table 3).This increased to 11.57% among individuals, and the highest percentage of genetic variation was observed at the individual level (83.56%) (Table 3).
The pairwise genetic differentiation (F ST ) matrix of the 11 populations of A. aegypti in Saudi Arabia is shown in Table 4. Pairwise F ST estimates were low, ranging between 0.014 and 0.161, but most of the F ST values were significant.Bayesian clustering analysis using STRUCTURE (Pritchard et al., 2000) identified K = 2 as the optimal number of genetic clusters for the 11 populations in Saudi Arabia (Figure 3a).This was supported by an extremely high delta K value which changed almost to zero at values of K = 3 onwards, indicating two strongly differentiated genetic groupings with little genetic structuring beyond this.The Jazan highland population was genetically distinct as one group (orange colour, Figure 3a), while the western regions (Makkah, Jeddah and Madinah) and Najran comprised the second group (blue colour, Figure 3a).The Sahil and lowland of Jazan show admixture of genetically distinct clusters (Figure 3a).The optimal number of clusters identified by STRUCTURE remained at two, again with extremely strong support from the delta K values, even when the Thailand and Ugandan populations were included in the analysis (Figure 3b).As Thailand and Uganda were completely distinct from each other and each

| Gene flow (Nm) network
A high migration rate was detected within the western regions (between Makkah, Jeddah and Madinah) as well as between the Jazan lowland and Sahil regions (Figure 5a).Jazan highland and Najran were both genetically isolated from other populations.When Uganda and Thailand were included, the network revealed some connectivity of Uganda with the Jazan highland and some connectivity of Thailand with the western region, albeit low in both cases.The strength of gene flow within the western region, and between Sahil and Jazan, remained high (Figure 5b).

| Demographic analysis and population history
Genetic admixture was observed in Sahil and some of the Jazan collections (Figure 3a); therefore, evolutionary scenarios were tested three times using different populations for the Saudi Arabia regions.First, when the evolutionary scenarios considered all populations of Saudi Arabia as representatives of Saudi Arabian A. aegypti, there was extremely strong support for genetic admixture involving African and non-African populations (posterior probability, p = 0.9637) (Figure 6a).Second, when the analysis  was run using only the Jazan highland population (Figure 6b), the admixture scenario was still supported but the posterior support probability was lower (p = 0.8041, Figure 6b).This was due to significant support being given to the model of Jazan having descended from African populations only (p = 0.1926, Figure 6b).
Third, when the evolutionary scenarios included only the western  S5.

| DISCUSS ION
The present study significantly expands upon previous studies of A.
aegypti genetic population structure in Saudi Arabia that used mitochondrial and ribosomal DNA markers (Khater et al., 2021) by using 17 microsatellite markers and by covering a wider geographic region.
This study covers all six geographic regions where dengue has been reported in Saudi Arabia: Jazan, Sahil, Jeddah, Makkah, Madinah and Najran (Ministry of Health, 2016).Further, using comparisons with an African and a Southeast Asian population to represent the ancestral form of A. aegypti from Africa and the pantropical domestic form, respectively, enabled us to elucidate the genetic ancestry of A.
aegypti in Saudi Arabia.
The overall level of genetic differentiation among the studied regions within Saudi Arabia was low.This, together with the absence of a correlation between genetic and geographic distance, is consistent with long-distance passive dispersal of A. aegypti in Saudi Arabia.Passive dispersal through human transportation (largely in the form of immature stages or eggs) is commonly inferred in A. aegypti (Carvajal et al., 2020;Fernando et al., 2020;Hlaing et al., 2010;Maffey et al., 2020Maffey et al., , 2022;;Rasheed et al., 2013).This has generally  et al., 2019;Zayed et al., 2017).
In contrast, the STRUCTURE analyses revealed that the Jazan highlands was genetically distinct from other populations in Saudi Arabia having more genetic similarity with Uganda than western populations of Saudi Arabia.The highest genetic diversity of the Saudi Arabian populations was in the Jazan highlands which had genetic diversity as high as that of Uganda.This information is consistent with the ABC analysis which indicates that, although the Jazan highlands population still has a signal of genetic admixture, that much of the ancestry in this region stems from dispersal directly from sub-Saharan Africa.This is further supported by the observation made during collections in the Jazan highlands that mosquitoes from this area were dark in colour as is A. aegypti formosus from African forests.The STRUCTURE and ABC analyses indicated genetic admixture between the two genetic groupings originating from Africa or the global population in the intervening geographic regions of Sahil and the Jazan region.The mixing of these two distinct genetic forms of A. aegypti has also recently been inferred to have occurred in Sudan (Elnour et al., 2022).
The retention of the high genetic diversity of A. aegypti in the Jazan highlands indicates that it is the result of historical migration involving a larger number of founders, potentially repeated migration over a long time period, since there is no evidence of a genetic bottleneck.The African-like population in the Jazan highlands may therefore have been introduced through Yemen as Jazan shares a border with Yemen and is close to some parts of Eastern Africa.Although A. aegypti was first recorded in the Arabian Peninsula area (covering Saudi Arabia and Yemen) in the mid-20th century (Mattingly & Knight, 1956), Yemen is known to have the oldest port in the region (Aden), which traded with Eastern Africa and Southern Asia about 2000-3000 years ago (Martin & Vigne, 1997).Further investigations using Yemeni samples would allow this hypothesis of entry through Aden to be tested.Nonetheless, current shipments from Africa largely use the port of Jeddah (Waters, 2017); therefore, further analyses are needed to determine whether A. aegypti is still entering the Arabian Peninsula and if so via which route(s)?This knowledge is important as it could inform surveillance efforts to detect pathogens that might be imported from Africa.The biological characteristics of the genetically distinct population in the Jazan highlands also require further investigation.In particular, the presence of a unique African-like population in the Jazan highlands might differ in disease transmission ability compared to non-African populations (Dickson et al., 2014;Weetman et al., 2018).Therefore, findings of the present study should be taken into account when designing vector control strategies, like potential future trials of Wolbachia releases for dengue control.

ACK N OWLED G EM ENTS
We are grateful to Dr. Faisal Almathen from King Faisal University all of these, the deepest split is between African and non-African global populations, based on previous reports of genetic structure of global populations (Gloria-Soria et al., 2016).The three scenarios tested were as follows: (1) African populations gave rise to Saudi Arabia populations; (2) populations from global domestic populations outside Africa gave rise to Saudi Arabia populations; and (3) both African and out-of-African populations gave rise to Saudi Arabia populations (admixture scenario).Each evolutionary scenario was tested three times using a different composition for the Saudi Arabia populations (based on the STRUCTURE plot), that is, (A) all 11 populations in Saudi Arabia, (B) Jazan highlands, and (C) Jeddah and Makkah.The (B) and (C) were selected to represent the divergent groupings within Saudi Arabia detected by the STRUCTURE analysis.

F
Field photos and examples of potential larval habitat for Aedes aegypti in Saudi Arabia.(a, i, j) Air-conditioning water containers, (b, c, e, m) animals drinking containers, (d) rainwater in rock hole, (f) water coolers in mosques and neighbourhoods, (g) water coolers inside houses, (h) construction containers, (k) discarded car tire and (l) disposable plastic container.
337) in Saudi Arabia (Additional file 1: FigureS2).No correlation was observed when the Jazan highlands or Najran populations were excluded.When Mantel tests were performed at the local scale, within each population, a significant positive but low correlation (r 2 = 0.225, p < 0.0001) was observed in the Najran population (n = 32) but not in any other population (Additional file 1: FigureS2). FigureS4a,b).

F
I G U R E 3 Population structure analysis of Aedes aegypti based on 17 microsatellite loci using STRUCTURE from (a) eleven populations from Saudi Arabia (K = 2) and (b) eleven populations from Saudi Arabia, one population from Thailand and one population from Uganda (K = 2).Each colour indicates a distinct genetic grouping, and each vertical line represents an individual with the proportion of population ancestry from each genetic grouping in that individual indicated by the height of the colour.To the left of the STRUCTURE plots is the graph showing estimation of the optimal K value according to the Evanno et al. (2005) method, with the optimal estimated value indicated by * next to the structure plot.

F
I G U R E 4The discriminant analysis of principal component (DAPC) clustering analysis based on 17 microsatellite loci for Aedes aegypti populations from (a) Saudi Arabia (11 populations) and (b) Saudi Arabia, Thailand and Uganda (13 populations) using Adegenet(Jombart, 2008).Each clustering colour corresponds to a population, and dots represent individuals.A bar plot of the discriminant analysis eigenvalues corresponds to the variance ratio between groups.Sahil_I = SahilW; Sahil_II = SahilM; Sahil_III = SahilE; Sahil_IV = SahilA.F I G U R E 5 Network of directed migration routes using divMigrate with α = 0.05.Each circle with a code represents a population which is connected with arrows with the numbers representing the migration values, Nm (number of migrants per generation).The weight of the line corresponds to the extent of migration.(a) The gene flow of seven Saudi Arabia study populations; (b) The network of seven Saudi Arabia, one Uganda and one Thailand populations.(Jeddah and Makkah) populations, the best-supported scenario was for a sole origin of the Saudi Arabian populations from the global domestic form represented here by Thailand (p = 0.7062) (Figure 6c).Overall, these data provide strong support for Saudi Arabia populations of A. aegypti being the result of genetic admixture, but they also indicate that admixture proportions vary geographically across Saudi Arabia with the Jazan highland having a greater proportion of genetic ancestry from Africa and the western regions having a great proportion from non-African populations.Details on the posterior probabilities and split time between regions used as priors for the ABC analysis and inferred for the supported scenarios are provided in Additional file 1: Table

F
Three evolutionary scenarios used to test alternative hypotheses of colonization and admixture of Aedes aegypti in Saudi Arabia using DIYABC software of Cornuet et al. (2014) using microsatellite data from Saudi Arabia (SA) populations, the Thailand (Thai) population to represent the global domestic form and Uganda (UG) to represent the ancestral African form.From left to right in each panel, the evolutionary scenarios are as follows: colonization of SA by the ancestral African form only; colonization of SA by the global domestic form only; and colonization of SA by both forms with admixture.This set of three evolutionary scenarios was tested using different populations from Saudi Arabia: (a) all populations from Saudi Arabia (SA); (b) Jazan highlands (JazH) only; and (c) Jeddah and Makkah only (JED+MAK).
been attributed to facilitation by human movement, particularly by road.The volume of automobiles moving through the Sahil region (from Jazan to Makkah and Jeddah) is high which corresponds to the particularly high levels of migration of A. aegypti inferred to occur between these regions.Conversely, the low road connectivity towards Najran from the other locations studied here may be responsible for the slightly higher genetic isolation of Najran.Overall, the patterns of genetic differentiation and migration detected here and their correlation with road connectivity are consistent with transportation networks facilitating the passive dispersal of A. aegypti in Saudi Arabia (Figure1).This could facilitate the spread of pathogens and insecticide resistance to new locations in the country, that is, north or east.The study of the population structure of A. aegypti in Saudi Arabia revealed that there are two main genetic groupings within Saudi Arabia with extensive genetic admixture in some locations.The similarity of these genetic groupings to ancestral African populations (represented by Uganda) or the global domestic form (represented by the Thailand form) together with the ABC analysis indicates that Saudi Arabia has been colonized by A. Aegypti from Africa as well as by the pantropical domestic form.Clustering of the western populations with Thailand in the present study is consistent with a previous global diversity study(Gloria-Soria et al., 2016), where Jeddah was clustered with Pakistan or the New World.These populations in Saudi Arabia also had low genetic diversity similar to that observed in the Thailand population.Together, this indicates that A. aegypti in this part of Saudi Arabia is indistinguishable from global domestic populations of A. aegypti.This might be explained by the large amount of international shipping coming into Jeddah as well as the popularity of Makkah.Every year, more than two and a half million people gather in Makkah (Mecca) from over 180 nations across the world (particularly Southeast Asia) to practise their religion (Hajj and Umrah), arriving through the international airport in Jeddah (Haridi

and
Paing Soe and Franziska Elsner-Gearing from University of Manchester, for helpful discussions with analysis.The authors extend their appreciation to the Deputyship for Research & Innovation, Ministry of Education in Saudi Arabia for funding this research work through the project number ISP-2024.This research was carried out with financial support from Jazan University, Saudi Arabia, and Saudi Arabia Cultural Bureau in London (SACB) PhD Studentship (Fund: AM) and Research England QR GCRF allocation to the University of Manchester.We also thank the Office of Research Administration, TA B L E 1 Country, collection regions, site names (code), number of samples, geographical coordinates and collection date of Aedes aegypti used in the present study.

Table 2
Pairwise F ST matrix estimates for Aedes aegypti populations in Saudi Arabia.
Note: F ST estimates in bold have p-values ranging from 0.009 to 0.035.